Visual Semantics: Extracting Visual information from Text Accompanying Pictures
نویسندگان
چکیده
This research explores the interaction of textual and photographic information in document understanding. The problem of performing generalpurpose vision without a priori knowledge is di cult at best. The use of collateral information in scene understanding has been explored in computer vision systems that use scene context in the task of object identi cation. The work described here extends this notion by de ning visual semantics, a theory of systematically extracting picture-speci c information from text accompanying a photograph. Speci cally, this paper discusses the multi-stage processing of textual captions with the following objectives: (i) predicting which objects (implicitly or explicitly mentioned in the caption) are present in the picture and (ii) generating constraints useful in locating/identifying these objects. The implementation and use of a lexicon speci cally designed for the integration of linguistic and visual information is discussed. Finally, the research described here has been successfully incorporated into PICTION, a caption-based face identi cation system.
منابع مشابه
Visual Semantics for Reducing False Positives in Video Search
This research explores the interaction of textual and visual information in video indexing and searching. Much of the recent work has focused on machine learning techniques that learn from both text and image/video features, e.g. the text surrounding a photograph on a web page. This is useful in similarity search (i.e. searching by example), but has drawbacks when more semantic search is desire...
متن کاملComparative Approach to the Relationship Between Text and Hand Visual Language in Tahmasebi’s Shahnameh Pictures
The painters of Tahmasbi Shahnameh, in order to depict the text full of the story of Shahnameh, tried to convey emotions and excitement to the audience by using the visual language of the hand. Due to the multiplicity of applications of this type of nonverbal communication in different situations, the painter may have undergone changes in parts of her painting under the influence of various fac...
متن کاملLearning the Semantics of Words and Pictures
We present a statistical model for organizing image collections which integrates semantic information provided by associated text and visual information provided by image features. The model is very promising for information retrieval tasks such as database browsing and searching for images based on text and/or image features. Furthermore, since the model learns relationships between text and i...
متن کاملInterrogation of a University Classrooms in the Court of Semantics: Managerial Implications
The purpose of this article, within the framework of an interpretive study, was to study the semantics of a universitychr('39')s classrooms to create a critical awareness of the meanings of the symptoms and their functions at the context of physical artifacts, besides their managerial implications. To accomplish this goal, after taking pictures of the structural elements of the studied classroo...
متن کاملUsing Eye Movement Analysis to Study Auditory Effects on Visual Memory Recall
Recent studies in affective computing are focused on sensing human cognitive context using biosignals. In this study, electrooculography (EOG) was utilized to investigate memory recall accessibility via eye movement patterns. 12 subjects were participated in our experiment wherein pictures from four categories were presented. Each category contained nine pictures of which three were presented t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1994